# Cross-language Support

Vit Giantopt Patch16 Siglip 256.v2 Webli
Apache-2.0
Vision Transformer model based on SigLIP 2 technology, focused on image feature extraction
Text-to-Image Transformers
V
timm
59
0
Outetts 0.3 1B GGUF
OuteTTS-0.3-1B is a multilingual text-to-speech model developed by OuteAI, supporting English, Chinese, Japanese, Korean, French, and German.
Speech Synthesis Supports Multiple Languages
O
gaianet
34
0
Outetts 0.3 1B GGUF
OuteTTS-0.3-1B is a multilingual text-to-speech model developed by OuteAI and quantized by Second State Inc.
Speech Synthesis Supports Multiple Languages
O
second-state
151
1
Outetts 0.3 500M GGUF
OuteTTS-0.3-500M is a multilingual text-to-speech model developed by OuteAI and released under the cc-by-nc-4.0 license.
Speech Synthesis Supports Multiple Languages
O
gaianet
79
0
Speechless Llama3.2 V0.1
Apache-2.0
Speechless is a compact open-source text-to-semantic model (1 billion parameters) designed to directly convert audio into discrete semantic tokens without relying on traditional text-to-speech (TTS) models.
Speech Recognition Supports Multiple Languages
S
Menlo
39
3
Speechless Llama3.2 V0.1
Apache-2.0
Speechless is a compact open-source text-to-semantic model (1 billion parameters) designed to directly convert audio into discrete semantic representation tokens without relying on traditional text-to-speech (TTS) models.
Speech Synthesis Supports Multiple Languages
S
homebrewltd
28
3
Outetts 0.2 500M GGUF
OuteTTS-0.2-500M is a multilingual text-to-speech model developed by OuteAI, supporting English, Chinese, Japanese, and Korean.
Speech Synthesis Supports Multiple Languages
O
gaianet
44
0
GPT SoVITS V1 Base
MIT
GPT-SoVITS (V1) is a multilingual text-to-speech foundation model supporting Chinese, English, and Japanese.
Speech Synthesis Supports Multiple Languages
G
None1145
20
1
Whisper Large V3 Gguf
Apache-2.0
Whisper is a multilingual automatic speech recognition (ASR) system that supports speech-to-text tasks in multiple languages.
Speech Recognition Supports Multiple Languages
W
vonjack
931
14
Xlm Roberta Base Language Detection ONNX
A multilingual detection model based on XLM-RoBERTa, capable of identifying the language category of text.
Text Classification Transformers
X
Oblix
16
1
Faster Whisper Large V2
MIT
Whisper large-v2 is a large-scale automatic speech recognition (ASR) model developed by OpenAI, supporting multilingual speech-to-text tasks.
Speech Recognition Supports Multiple Languages
F
Systran
948.29k
34
Sbert All MiniLM L6 With Pooler
Apache-2.0
This is an ONNX-converted model based on sentence-transformers/all-MiniLM-L6-v2, capable of mapping sentences and paragraphs into a 384-dimensional dense vector space, suitable for tasks like clustering or semantic search.
Text Embedding Transformers English
S
vamsibanda
28
0
Zabanshenas Roberta Base Mix
Apache-2.0
Zabanshenas is a Transformer-based solution for identifying the most probable language of written documents/text.
Text Classification Transformers Supports Multiple Languages
Z
m3hrdadfi
23
10
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase